Identifying Quora question pairs having the same intent
نویسندگان
چکیده
This paper presents a system which uses a combination of multiple text similarity measures of varying complexities to classify Quora question pairs as duplicate or different. The solution uses a support vector classifier model trained using the precomputed features ranging from longest common sub-string and sub sequences to word similarity based on lexical and semantic resources. The scope of this project is to tackle the short text similarity classification problem by applying Natural Language Processing techniques. The approach and methodologies used in this paper can be further extended to implement automatic short answer grading systems, essay grading system and textual entailment detection problems as well.
منابع مشابه
Siamese Neural Networks with Random Forest for detecting duplicate question pairs
Determining whether two given questions are semantically similar is a fairly challenging task given the different structures and forms that the questions can take. In this paper, we use Gated Recurrent Units(GRU) in combination with other highly used machine learning algorithms like Random Forest, Adaboost and SVM for the similarity prediction task on a dataset released by Quora, consisting of ...
متن کاملIdentifying Purchase Intent from Social Posts
In present times, social forums such as Quora and Yahoo! Answers constitute powerful media through which people discuss on a variety of topics and express their intentions and thoughts. Here they often reveal their potential intent to purchase ‘Purchase Intent’ (PI). A purchase intent is defined as a text expression showing a desire to purchase a product or a service in future. Extracting posts...
متن کاملAnalysis and Prediction of Question Topic Popularity in Community Q&A Sites: A Case Study of Quora
In the past few years, Quora a community-driven social platform for question and answering, has grown exponentially from a small community of users into one of the largest and reliable source of Q&A on the Internet. Quora has a built-in social structure integrated to its backbone; users can follow each other, follow question, topics etc. Apart from the social connections that Quora provides, it...
متن کاملDuplicate Question Pair Detection with Deep Learning
Determining whether two questions are asking the same thing can be challenging, as word choice and sentence structure can vary significantly. Traditional natural language processing techniques such as shingling have been found to have limited success in separating related question from duplicate questions. Using a dataset of 400,000 labeled question pairs provided by question-and-answer forum Q...
متن کاملWho is Authoritative? Understanding Reputation Mechanisms in Quora
As social Q&A sites gain popularity, it is important to understand how users judge the authoritativeness of users and content, build reputation, and identify and promote high quality content. We conducted a study of emerging social Q&A site Quora. First, we describe user activity on Quora by analyzing data across 60 question topics and 3917 users. Then we provide a rich understanding of issues ...
متن کامل